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Abstract — We consider the problem of sensor selection for 
time-optimal detection of a hypothesis. We consider a group of 
sensors transmitting their observations to a fusion center. The 
fusion center considers the output of only one randomly chosen 
sensor at the time, and performs a sequential hypothesis test. We 
consider the class of sequential tests which are easy to implement, 
asymptotically optimal, and computationally amenable. For three 
distinct performance metrics, we show that, for a generic set of 
sensors and binary hypothesis, the fusion center needs to consider 
at most two sensors. We also show that for the case of multiple 
hypothesis, the optimal policy needs at most as many sensors to 
be observed as the number of underlying hypotheses. 

Index Terms — Sensor selection, decision making, SPRT, 
MSPRT, sequential hypothesis testing, linear-fractional program- 
ming. 

I. INTRODUCTION 

In today's information-rich world, different sources are 
best informers about different topics. If the topic under con- 
sideration is well known beforehand, then one chooses the 
best source. Otherwise, it is not obvious what source or 
how many sources one should observe. This need to identify 
sensors (information sources) to be observed in decision 
making problems is found in many common situations, e.g., 
when deciding which news channel to follow. When a person 
decides what information source to follow, she relies in general 
upon her experience, i.e., one knows through experience what 
combination of news channels to follow. 

In engineering applications, a reliable decision on the un- 
derlying hypothesis is made through repeated measurements. 
Given infinitely many observations, decision making can be 
performed accurately. Given a cost associated to each obser- 
vation, a well-known tradeoff arises between accuracy and 
number of iterations. Various sequential hypothesis tests have 
been proposed to detect the underlying hypothesis within a 
given degree of accuracy. There exist two different classes 
of sequential tests. The first class includes sequential tests 
developed from the dynamic programming point of view. 
These tests are optimal and, in general, difficult to imple- 
ment |5lj. The second class consists of easily-implementable 
and asymptotically-optimal sequential tests; a widely-studied 
example is the Sequential Probability Ratio Test (SPRT) 
for binary hypothesis testing and its extension, the Multi- 
hypothesis Sequential Probability Ratio Test (MSPRT). 
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In this paper, we consider the problem of quickest decision 
making using sequential probability ratio tests. Recent ad- 
vances in cognitive psychology J7) show that the performance 
of a human performing decision making tasks, such as "two- 
alternative forced choice tasks," is well modeled by a drift 
diffusion process, i.e., the continuous-time version of SPRT. 
Roughly speaking, modeling decision making as an SPRT 
process is somehow appropriate even for situations in which 
a human is making the decision. 

Sequential hypothesis testing and quickest detection prob- 
lems have been vastly studied ifTTl . |4). The SPRT for binary 
decision making was introduced by Wald in |2 1 1. and was 
extended by Armitage to multiple hypothesis testing in (TJ. 
The Armitage test, unlike the SPRT, is not necessarily op- 
timal ll24l . Various other tests for multiple hypothesis test- 
ing have been developed throughout the years; a survey is 
presented in lfl"8l . Designing hypothesis tests, i.e., choosing 
thresholds to decide within a given expected number of 
iterations, through any of the procedures in |fl"8l is infeasible as 
none of them provides any results on the expected sample size. 
A sequential test for multiple hypothesis testing was developed 
in 0, iflOl . and AD . which provides with an asymptotic 
expression for the expected sample size. This sequential test is 
called the MSPRT and reduces to the SPRT in case of binary 
hypothesis. 

Recent years have witnessed a significant interest in the 
problem of sensor selection for optimal detection and estima- 
tion. Tay et al |2Q) discuss the problem of censoring sensors 
for decentralized binary detection. They assess the quality of 
sensor data by the Neyman-Pearson and a Bayesian binary 
hypothesis test and decide on which sensors should transmit 
their observation at that time instant. Gupta et al |[T3| focus on 
stochastic sensor selection and minimize the error covariance 
of a process estimation problem. Isler et al [fT4l propose 
geometric sensor selection schemes for error minimization 
in target detection. Debouk et al J9) formulate a Markovian 
decision problem to ascertain some property in a dynamical 
system, and choose sensors to minimize the associated cost. 
Wang et al ll22l design entropy-based sensor selection algo- 
rithms for target localization. Joshi et al lfT31 present a convex 
optimization-based heuristic to select multiple sensors for 
optimal parameter estimation. Bajovic et al J5] discuss sensor 
selection problems for Neyman-Pearson binary hypothesis 
testing in wireless sensor networks. 

A third and last set of references related to this paper 
are those on linear-fractional programming. Various iterative 
and cumbersome algorithms have been proposed to optimize 
linear-fractional functions (8), Q. In particular, for the prob- 
lem of minimizing the sum and the maximum of linear- 
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fractional functionals, some efficient iterative algorithms have 
been proposed, including the algorithms by Falk et al ifPU l and 
by Benson (6). 

In this paper, we analyze the problem of time-optimal se- 
quential decision making in the presence of multiple switching 
sensors and determine a sensor selection strategy to achieve 
the same. We consider a sensor network where all sensors are 
connected to a fusion center. The fusion center, at each instant, 
receives information from only one sensor. Such a situation 
arises when we have interfering sensors (e.g., sonar sensors), 
a fusion center with limited attention or information processing 
capabilities, or sensors with shared communication resources. 
The fusion center implements a sequential hypothesis test 
with the gathered information. We consider two such tests, 
namely, the SPRT and the MSPRT for binary and multiple 
hypothesis, respectively. First, we develop a version of the 
SPRT and the MSPRT where the sensor is randomly switched 
at each iteration, and determine the expected time that these 
tests require to obtain a decision within a given degree of 
accuracy. Second, we identify the set of sensors that minimize 
the expected decision time. We consider three different cost 
functions, namely, the conditioned decision time, the worst 
case decision time, and the average decision time. We show 
that the expected decision time, conditioned on a given hypoth- 
esis, using these sequential tests is a linear-fractional function 
defined on the probability simplex. We exploit the special 
structure of our domain (probability simplex), and the fact 
that our data is positive to tackle the problem of the sum and 
the maximum of linear-fractional functionals analytically. Our 
approach provides insights into the behavior of these functions. 
The major contributions of this paper are: 

i) We develop a version of the SPRT and the MSPRT 
where the sensor is selected randomly at each obser- 
vation. 

ii) We determine the asymptotic expressions for the thresh- 
olds and the expected sample size for these sequential 
tests. 

iii) We incorporate the processing time of the sensors into 
these models to determine the expected decision time. 

iv) We show that, to minimize the conditioned expected 
decision time, the optimal policy requires only one 
sensor to be observed. 

v) We show that, for a generic set of sensors and M 
underlying hypotheses, the optimal average decision 
time policy requires the fusion center to consider at most 
M sensors. 

vi) For the binary hypothesis case, we identify the optimal 
set of sensors in the worst case and the average decision 
time minimization problems. Moreover, we determine an 
optimal probability distribution for the sensor selection. 

vii) In the worst case and the average decision time mini- 
mization problems, we encounter the problem of min- 
imization of sum and maximum of linear-fractional 
functionals. We treat these problems analytically, and 
provide insight into their optimal solutions. 

The remainder of the paper is organized in following way. In 
Section [II] we present the problem setup. Some preliminaries 



are presented in Section UTU We develop the switching-sensor 
version of the SPRT and the MSPRT procedures in Section Hvl 
In Section [V] we formulate the optimization problems for 
time-optimal sensor selection, and determine their solution. 
We elucidate the results obtained through numerical examples 
in Section |VI] Our concluding remarks are in Section I VII I 

II. PROBLEM SETUP 

We consider a group of n > 1 agents (e.g., robots, sensors, 
or cameras), which take measurements and transmit them to a 
fusion center. We generically call these agents "sensors." We 
identify the fusion center with a person supervising the agents, 
and call it "the supervisor." 




Fig. 1. The agents A transmit their observation to the supervisor S, one at 
the time. The supervisor performs a sequential hypothesis test to decide on 
the underlying hypothesis. 

The goal of the supervisor is to decide, based on the 
measurements it receives, which of the M > 2 alternative 
hypotheses or "states of nature," Hk, k G {0, . . . , M — 1} is 
correct. For doing so, the supervisor uses sequential hypothesis 
tests, which we briefly review in the next section. 

We assume that only one sensor can transmit to the supervi- 
sor at each (discrete) time instant. Equivalently, the supervisor 
can process data from only one of the n agents at each time. 
Thus, at each time, the supervisor must decide which sensor 
should transmit its measurement. We are interested in finding 
the optimal sensor(s), which must be observed in order to 
minimize the decision time. 

We model the setup in the following way: 

i) Let s; € {1, . . . ,n} indicate which sensor transmits its 
measurement at time instant / € N. 

ii) Conditioned on the hypothesis H^, k £ {0, . . . , M — 1}, 
the probability that the measurement at sensor s is y, is 
denoted by fg{y). 

iii) The prior probability of the hypothesis H k , k £ 
{0, ...,M— 1}, being correct is ir^. 

iv) The measurement of sensor s at time I is y S[ . We assume 
that, conditioned on hypothesis Hk, y Sl is independent 
of y(s f ), for (I, s{) ^ (f,s r ). 

v) The time it takes for sensor s to transmit its measurement 
(or for the supervisor to process it) is T s > 0. 

vi) The supervisor chooses a sensor randomly at each time 
instant; the probability to choose sensor s is stationary 
and given by q s . 

vii) The supervisor uses the data collected to execute a 
sequential hypothesis test with the desired probability of 
incorrect decision, conditioned on hypothesis Hf., given 
by a k . 
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viii) We assume that there are no two sensors with identical 
conditioned probability distribution f k (y) and process- 
ing time T s . If there are such sensors, we club them 
together in a single node, and distribute the probability 
assigned to that node equally among them. 

IE. PRELIMINARIES 

A. Linear-fractional function 

Given parameters A G M. qxp , B G R ? , c G R p , and del, 
the function g : {z G R p | c T z + d > 0} -> R«, defined by 



c T .x + d' 

is called a linear-fractional function (8). A linear-fractional 
function is quasi-convex as well as quasi-concave. In partic- 
ular, if q = 1, then any scalar linear- fractional function g 
satisfies 



g(tx+ (1 -t)y) < msx{g{x),g{y)}, 
g(tx + (1 - t)y) > min{g(x), g(y)}, 

c T z + d>0}. 



(1) 



for all t G [0, 1] and x, y G {z G 



B. Kullback-Leibler distance 



Given two probability distributions functions f\ : R — > R>o 
and / 2 : R — > R>o, the Kullback-Leibler distance P : £ x x 
Z3 1 -> R is defined by 



Z>(/i,/ 2 )=E 



log 



AW 

/2PO 



Further, /2) > 0, and the equality holds if and only if 

fx = /2 almost everywhere. 

C. Sequential Probability Ratio Test 

The SPRT is a sequential binary hypothesis test that pro- 
vides us with two thresholds to decide on some hypothesis, 
opposed to classical hypothesis tests, where we have a single 
threshold. Consider two hypothesis Hq and Hi, with prior 
probabilities ttq and 7Ti, respectively. Given their conditional 
probability distribution functions f(y\Ho) = f°(y) and 
f(y\Hi) = f 1 (y), and repeated measurements {yi, y^, ■ ■ .}, 
with Aq defined by 



A = log 



7Tl 
7T0' 



(2) 



the SPRT provides us with two constants rjo and rji to decide 
on a hypothesis at each time instant I, in the following way: 



i) Compute the log likelihood ratio: A; := log ja 



N 



ii) Integrate evidence up to time N, i.e., A^r := J^Aj 

iii) Decide only if a threshold is crossed, i.e., 



1=0 



AiV > T/l, 

Aw < rjo, 
A N G ]t7o,t7i[ 



say iJi, 
say H , 

continue sampling. 



Given the probability of false alarm ¥(Hi\Hq) = ceo an d 
probability of missed detection ¥(Hq\Hi) = ct\, the Wald's 
thresholds ?/o and r/i are defined by 

7/o = log - ai , and 771= log- — — . (3) 
1 — a a 

The expected sample size N, for decision using SPRT is 
asymptotically given by 

(1 - a )r]o + aorji - A 



E [N\H ] 
V[N\Hi] 



(1 - ai)7?i 



and 



Ao 



(4) 



as —?/0) 7/1 — > 00. The asymptotic expected sample size expres- 
sions in equation © are valid for large thresholds. The use of 
these asymptotic expressions as approximate expected sample 
size is a standard approximation in the information theory lit- 
erature, and is known as Wald's approximation 0J, ifTTl . |fl9l . 

For given error probabilities, the SPRT is the optimal se- 
quential binary hypothesis test, if the sample size is considered 
as the cost function lfl9l . 

D. Multi-hypothesis Sequential Probability Ratio Test 

The MSPRT for multiple hypothesis testing was introduced 
in J5), and was further generalized in iflOl and IfTTl . It is 
described as follows. Given AI hypotheses with their prior 
probabilities TTk,k G {0, . . . , M — 1}, the posterior probability 
after N observations yi, I G {!,..., N} is given by 



p k N =¥(H = H k \ yi ,...,y N ) = 



N 



M-l 



3=1 



E ^ n p{yi) 



1=1 



where f k is the probability density function of the observation 
of the sensor, conditioned on hypothesis k. 

Before we state the MSPRT, for a given N, we define k by 



N 



k = argmax ttj II f 3 (y, 



1=1 



je{o,...,Af-i} 

The MSPRT at each sampling iteration / is defined as 



l otherwise, 



Pi > iq^:if or at l east one sa Y Hj., 



continue sampling, 



where the thresholds 7/fe, for given frequentist error probabili- 
ties (accept a given hypothesis wrongly) ah, k G {0, . . . , M — 
1}, are given by 

% = , (5) 

7Tfc7fc 

where jf. G ]0, 1[ is a constant function of f k (see Q). 

It can be shown [5| that the expected sample size of the 
MSPRT, conditioned on a hypothesis, satisfies 

- log 7/fe 



E[N\H k 



as max r\ k 
fee{o,...,M-i} 



0^ 



where S k = mm{V(f k ,P) \ j G {0, . . . , M - 1} \ {k}}. 

The MSPRT is an easily-implementable hypothesis test and 
is shown to be asymptotically optimal in 13], IflOl . 
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IV. Sequential hypothesis tests with switching 

SENSORS 

A. SPRT with switching sensors 

Consider the case when the fusion center collects data from 
n sensors. At each iteration the fusion center looks at one 
sensor chosen randomly with probability q s , s G {1, . . . ,rt}. 
The fusion center performs SPRT with the collected data. 
We define this procedure as SPRT with switching sensors. If 
we assume that sensor si is observed at iteration Z, and the 
observed value is y Sl , then SPRT with switching sensors is 
described as following, with the thresholds 770 and 771 defined 
in equation ||3}, and Ao defined in equation (f2]i: 

i) Compute log likelihood ratio: 



A; := log 



my si y 



N 



ii) Integrate evidence up to time N, i.e., A/v := J^A;, 

1=0 

iii) Decide only if a threshold is crossed, i.e., 



A N > 771, 
A w < 770, 
Aw G ] 770 , r?i [ 



say Hi, 
say if , 

continue sampling. 



Lemma 1 (Expected sample size): For the SPRT with 
switching sensors described above, the expected sample size 
conditioned on a hypothesis is asymptotically given by: 



E[N\H ] 



E [AT I if 1 



(1 - Qo)?7 + q 77i - A 

n ■ 

E^/Mi) 

s=l 

aiVo + (1 - 0-1)111 - A 

n ' 
s=l 



and 



(6) 



as —770, ?7i -* °°- 

Proof: Similar to the proof of Theorem 3.2 in 

The expected sample size converges to the values in equa- 
tion © for large thresholds. From equation ([3J, it follows 
that large thresholds correspond to small error probabilities. 
In the remainder of the paper, we assume that the error 
probabilities are chosen small enough, so that the above 
asymptotic expression for sample size is close to the actual 
expected sample size. 

Lemma 2 (Expected decision time): Given the processing 
time of the sensors T 3 , s G {1, . . . , n}, the expected decision 
time of the SPRT with switching sensors T d , conditioned on 
the hypothesis iffc, k G {0, 1}, is 



E[T d \H k ] 



E 1sT s 

s=i = q ■ t 

8=1 



, for each k G {0, 1}, (7) 



where T, I k € R>o> ar e constant vectors for each k £ {0, 1}. 

Proof: The decision time using SPRT with switching 
sensors is the sum of sensor's processing time at each iter- 
ation. We observe that the number of iterations in SPRT and 



the processing time of sensors are independent. Hence, the 
expected value of the decision time T4 is 

E[T d \H k ]=E[N\H k ]E[T], for each k e {0, 1}. (8) 

By the definition of expected value, 

n 

E[T} = Y / QsT s . (9) 

s=l 

From equations ©, ([8]), and (0 it follows that 

n 

E <lsT s 

E [T d \H k ] = i=i = — T T , for each k g {0, 1}, 



q- I k 



s=l 



where i* £ R>o is a constant, for each k G {0, 1}, and s 6 
{l,...,n}. M 

B. MSPRT with switching sensors 

We call the MSPRT with the data collected from n sensors 
while observing only one sensor at a time as the MSPRT with 
switching sensors. The one sensor to be observed at each time 
is determined through a randomized policy, and the probability 
of choosing sensor s is stationary and given by q s . Assume 
that the sensor s; G {l,...,n} is chosen at time instant 
I, and the prior probabilities of the hypothesis are given by 
7Tfe, k G {0, . . . , M — 1}, then the posterior probability after 
N observations yi, I G {!,..., N} is given by 



TV 



p k N =¥(H k \y u ...,y N ) 



1=1 



M-l / N . 

E M nfM 

J=0 



1=1 



Before we state the MSPRT with switching sensors, for a 
given N, we define k by 



N 



k= argmax 7T fc II /* (j/j). 

1=1 



fce{o,...,M-i} 



For the thresholds rjh, k G {0, . . . , M — 1}, defined in 
equation (0, the MSPRT with switching sensors at each 
sampling iteration N is defined by 

{Pn > TT^:' f° r at ^ east one sa y ^fc) 

otherwise, continue sampling. 

Before we state the results on asymptotic sample size and 
expected decision time, we introduce the following notation. 
For a given hypothesis H k , and a sensor s, we define jf k ^ 
by 

argmin V(f*,f s ). 



J(s,k) 



je{o,...,M-i} 



We also define 8 V : A„_i x (C 1 ) n x (/I 1 )" -> R by 

n 
s=l 

where A„_i represents the probability simplex in K" 
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Lemma 3 (Expected sample size): Given thresholds r)j, j G 
{0, . . . , M — 1}, the sample size N required for decision 
satisfies 

E[N\H k ] 1 



log T)k 

-0. 



as max 

je{o M-i} 

Proof: The proof follows from Theorem 5.1 of J5] and 
the observation that 



N 



Lemma 4 (Expected decision time): Given the processing 
time of the sensors T s ,s G {1, ...,n}, the expected deci- 
sion time Td conditioned on the hypothesis H k , for each 
k G {0, . . . , M — 1}, is given by 



E[T d \H k ] = 



log 1% 



k * — » n ■ 



q-T_ 

Jk ' 



(10) 



are constants. 



where T, I k G 

Proof: Similar to the proof of Lemma [2] 

V. OPTIMAL SENSOR SELECTION 

In this section we consider sensor selection problems with 
the aim to minimize the expected decision time of a sequential 
hypothesis test with switching sensors. As exemplified in 
Lemma |U the problem features multiple conditioned decision 
times and, therefore, multiple distinct cost functions are of 
interest. In Scenario I below, we aim to minimize the decision 
time conditioned upon one specific hypothesis being true; in 
Scenarios II and III we will consider worst-case and average 
decision times. In all three scenarios the decision variables 
take values in the probability simplex. 

Minimizing decision time conditioned upon a specific hy- 
pothesis may be of interest when fast reaction is required 
in response to the specific hypothesis being indeed true. For 
example, in change detection problems one aims to quickly 
detect a change in a stochastic process; the CUSUM algorithm 
(also referred to as Page's test) |[T6l is widely used in such 
problems. It is known [4| that, with fixed threshold, the 
CUSUM algorithm for quickest change detection is equivalent 
to an SPRT on the observations taken after the change has 
occurred. We consider the minimization problem for a single 
conditioned decision time in Scenario I below and we show 
that, in this case, observing the best sensor each time is the 
optimal strategy. 

In general, no specific hypothesis might play a special role 
in the problem and, therefore, it is of interest to simultane- 
ously minimize multiple decision times over the probability 
simplex. This is a multi-objective optimization problem, and 
may have Pareto-optimal solutions. We tackle this problem 
by constructing a single aggregate objective function. In the 
binary hypothesis case, we construct two single aggregate 
objective functions as the maximum and the average of the two 
conditioned decision times. These two functions are discussed 
in Scenario II and Scenario III respectively. In the multiple 



hypothesis setting, we consider the single aggregate objective 
function constructed as the average of the conditioned decision 
times. An analytical treatment of this function for M > 2, is 
difficult. We determine the optimal number of sensors to be 
observed, and direct the interested reader to some iterative 
algorithms to solve such optimization problems. This case is 
also considered under Scenario III. 

Before we pose the problem of optimal sensor selection, we 
introduce the following notation. We denote the probability 
simplex in R" by A„_i, and the vertices of the probability 
simplex A„_i by e.;, i G {l,...,n}. We refer to the line 
joining any two vertices of the simplex as an edge. Finally, 
we define g k : A n _i -> R, k G {0, . . . , M - 1}, by 

q-T 



9 k {q) 



q-I k ' 



A. Scenario I (Optimization of conditioned decision time): 

We consider the case when the supervisor is trying to 
detect a particular hypothesis, irrespective of the present hy- 
pothesis. The corresponding optimization problem for a fixed 
k G {0, . . . , M — 1} is posed in the following way: 

minimize q k (q) 
subject to q G A n _i. 

The solution to this minimization problem is given in the 
following theorem. 

Theorem 1 (Optimization of conditioned decision time): 
The solution to the minimization problem (TTTb is q* = e s *, 
where s* is given by 

T 

s* = argmin — , 

se{l,...,n} '.- 

and the minimum objective function is 



E[T*\H k 



(12) 



Proof: We notice that objective function is a linear- 
fractional function. In the following argument, we show that 
the minima occurs at one of the vertices of the simplex. 

We first notice that the probability simplex is the convex 
hull of the vertices, i.e., any point q in the probability simplex 
can be written as 



3=1 



a. 



1, and a s > 0. 



We invoke equation (fl]i, and observe that for some (3 G [0, 1] 
and for any s, r G {1, . . . , n} 

g k (f3e s + (1 - (3)e r ) > mm{g k (e s ), g k (e r )}, (13) 

which can be easily generalized to 

g k (q)> min g k (e s ), (14) 
ie{i,...,n} 

for any point q in the probability simplex A„_i. Hence, 
minima will occur at one of the vertices e s », where s* is 
given by 

T 

s* = argmin g k (e s ) = argmin 

s£{l,...,n} s£{l,...,n}*s 
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B. Scenario II (Optimization of the worst case decision time): 

For the binary hypothesis testing, we consider the multi- 
objective optimization problem of minimizing both decision 
times simultaneously. We construct single aggregate objective 
function by considering the maximum of the two objective 
functions. This turns out to be a worst case analysis, and the 
optimization problem for this case is posed in the following 
way: 



minimize max • 



subject to q E A„_i. 



(15) 



Before we move on to the solution of above minimization 
problem, we state the following results. 

Lemma 5 (Monotonicity of conditioned decision times): 
The functions g k , k G {0, . . . , M — 1} are monotone on 
the probability simplex A„_i, in the sense that given two 
points qi,q2 € A„_i, the function g k is monotonically 
non-increasing or monotonically non-decreasing along the 
line joining q\ and q 2 . 

Proof: Consider probability vectors qi,q 2 G A„_i. Any 
point q on line joining qi and q 2 can be written as q(t) = 
tqi + (1 - t)q 2 , t G ]0, 1[. We note that g k (q(t)) is given by: 



t( qi -T) + (l-t)(q 2 -T) 
t( qi -I k ) + (l-t)(q 2 -I k )- 



The derivative of g k along the line joining qi and q 2 is given 



by 



dt 



g k (q(t)) = {g k ( qi )~g k (q 2 )) 



( qi ■ I k )(q 2 ■ I k ) 



(t(«i ■/*) + (!-*)(«, -J*)) 2 



We note that the sign of the derivative of g k along the line 
joining two points qi,q 2 is fixed by the choice of q\ and 
q 2 . Hence, the function g k is monotone over the line joining 
qi and q 2 . Moreover, note that if g k {qi) ^ g k {q2), then g k 
is strictly monotone. Otherwise, g k is constant over the line 
joining qi and q 2 . ■ 

Lemma 6 (Location of min-max): Define g : A„_i — > R>o 
by g — max{g°, g 1 }. A minimum of g lies at the intersection 
of the graphs of g° and g 1 , or at some vertex of the probability 
simplex A„_i. 

Proof: Case 1: The graphs of g° and g 1 do not intersect 
at any point in the simplex A„_i. 

In this case, one of the functions g° and g 1 is an upper 
bound to the other function at every point in the probability 
simplex A„_i. Hence, g = g k , for some k G {0, 1}, at every 
point in the probability simplex A„_i. From Theorem [T] we 



g°(q*) > g\q*). Also, g°(q) = g^q), and g°(q*) < g°(q) 
by assumption. 

We invoke Lemma|5] and notice that g° and g 1 can intersect 
at most once on a line. Moreover, we note that g°(q*) > 
g 1 (q*), hence, along the half-line from q through q*, g° > g 1 , 
that is, g = g°. Since g°(q*) < g°(q), g is decreasing along 
this half-line. Hence, g should achieve its minimum at the 
boundary of the simplex A n _i, which contradicts that q* is 
in the relative interior of the simplex A n _i. In summary, if 
a minimum of g lies in the relative interior of the probability 
simplex A„_i, then it lies at the intersection of the graphs of 
g° and g 1 . 

The same argument can be applied recursively to show that 
if a minimum lies at some point q^ on the boundary, then 
either g°(q') = g 1 ^) or the minimum lies at the vertex. ■ 

In the following arguments, let Q be the set of points in the 
simplex A„_i, where g° = g 1 , that is, 



= {qeA n ^\q-(I°-I 1 )=0}. 



(16) 



Also notice that the set Q is non empty if and only if I — I 1 
has at least one non-negative and one non-positive entry. If the 
set Q is empty, then it follows from Lemma|6]that the solution 
of optimization problem in equation ( fT3T > lies at some vertex of 
the probability simplex A n _i. Now we consider the case when 
Q is non empty. We assume that the sensors have been re- 
ordered such that the entries in 1° — I 1 are in ascending order. 
We further assume that, for 1° —I 1 , the first m entries, m < n, 
are non positive, and the remaining entries are positive. 

Lemma 7 (Intersection polytope): If the set Q defined in 
equation ( fT6l ) is non empty, then the polytope generated by 
the points in the set Q has vertices given by: 



Q = {q sr \8€{1, 
where for each i E { 1 , . . 

(/o_/l)_(/u_ 7 l), 

i - sr. 



. , to} and r <G {to + 1, . 
n} 

if i = s, 



if i = r, 
otherwise. 



■,n}}, 



(17) 



Proof: Any q G Q satisfies the following constraints 

n 

X> = 1 , a.G[o,i], (18) 

s=l 

n 

J2is(i° s -i 1 s )=o, (19) 

s=l 

Eliminating q n , using equation ( fT8l and equation ( TT9b . we get: 

" _1 (TO _ Tl\ _ (TO _ Tl\ 

^/3 s g s = l, where [3 S = [ " , n T l \,\ 'X (20) 



s=l 



(1° - 1 1 ) 



know that the minima of g k on the probability simplex A n _i The equation (f20ll defines a hyperplane, whose extreme points 

lie at some vertex of the probability simplex A„_i. in K™^ 1 are given by 

Case 2: The graphs of g° and g 1 intersect at a set Q in the ^ 
probability simplex A„_i, and let q be some point in the set q sn = — e s , i £ {1, . . . , n — 1}. 

Q. P° 

Suppose, a minimum of g occurs at some point q* E Note that for s G {1,...,to}, q STl G A„_ 1 . Hence, these 

relint(A n _i), and q* ^ Q, where relint( ) denotes the relative points define some vertices of the polytope generated by points 

interior. With out loss of generality, we can assume that in the set Q. Also note that the other vertices of the polytope 
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can be determined by the intersection of each pair of lines 
through q sn and q rn , and e s and e r , for s G {1, . . . , m}, and 
r G {to + 1, . . . , n — 1}. In particular, these vertices are given 
by q sr defined in equation ( fTTI i. 

Hence, all the vertices of the polytopes are defined by q sr , 
s G {1, . . . , to}, r G {to + 1, . . . , n}. Therefore, the set of 
vertices of the polygon generated by the points in the set Q 
is Q. m 

Before we state the solution to the optimization problem 
< fT~5T >, we define the following: 

( S >*)G argmin & = f ff% = ( f ~ ^ , and 



s T s r 



se{l,-,m} 



.9two-sensors , / ) 

We also define 



(J r °. -I},)T S , -(7° - £)T r . 



J 1 ./". - J ,/ 1 , 



argmin max 



»e{i n) 

.9one-sensor(>*) = max 



jo"' jT 



and 



T * T * 



Theorem 2 (Worst case optimization): For the optimization 
problem (fT5t , an optimal probability vector is given by: 

* J &w* ? if £/one-sensor(^ ) ^ 5Wo-sensors j 7* ) ; 



if g 



one-sensor 



(iV*) > 3two-sensors(s*, T*), 



and the minimum value of the function is given by: 



min {g 



one-sensor 



(w*),g t 



wo-sensors 



( s *,r*)}. 



Proof: We invoke Lemma [6] and note that a minimum 
should lie at some vertex of the simplex A„_i, or at some 
point in the set Q. Note that g° = g 1 on the set Q, hence the 
problem of minimizing max{<7°, g 1 } reduces to minimizing 
g° on the set Q. From Theorem [1] we know that g° achieves 
the minima at some extreme point of the feasible region. From 
Lemma |7] we know that the vertices of the polytope generated 
by points in set Q are given by set Q. We further note that 
5two-sensors(s,?") and gone-sensor (w) are the value of objective 
function at the points in the set Q and the vertices of the 
probability simplex A„_i respectively, which completes the 
proof. ■ 

C. Scenario III (Optimization of the average decision time): 

For the multi-objective optimization problem of minimizing 
all the decision times simultaneously on the simplex, we for- 
mulate the single aggregate objective function as the average 
of these decision times. The resulting optimization problem, 
for M > 2, is posed in the following way: 



minimize 
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M- 



(21) 



subject to q G A„_i. 



In the following discussion we assume n > M, unless 
otherwise stated. We analyze the optimization problem in 
equation d2Tb as follows: 



Lemma 8 (Non-vanishing Jacobian): The objective func- 
tion in optimization problem in equation ( 1211 ) has no critical 



tM-I 



^>0 



point on A n _i if the vectors T,I ,. 
linearly independent. 

Proof: The Jacobian of the objective function in the 
optimization problem in equation (l2Tb is 



1 d 



M-l 



Mdq 
where T 

i> ■ A„_i 



fc=0 



M 



T 



tM—1 



• nx(M+l) 



and 



pAf+l 



is defined by 



M-l 

E 

k=0 



q-I k (q-I ) 2 



q-T 



For n > M, if the vectors T, 1°, . . . , J A/_1 are linearly 
independent, then V is full rank. Further, the entries of i/j 
are non-zero on the probability simplex A„_i. Hence, the 
Jacobian does not vanish anywhere on the probability simplex 
A n _i. ■ 

Lemma 9 (Case of Independent Information): For M — 2, 
if 1° and I 1 are linearly independent, and T = aol + ail 1 , 
for some ao, ai G R, then the following statements hold: 

i) if ao and a\ have opposite signs, then g° + g 1 has no 
critical point on the simplex A n _i, and 

ii) for ao, cn > 0, + has a critical point on the simplex 
A n _i if and only if there exists v G A„_i perpendicular 
to the vector ^/oqI — y^ajl 1 . 

Proof: We notice that the Jacobian of g° + g 1 satisfies 



(g./ ) 2 (g./ 1 ) 2 |(3°+ 5 1 ) 



= T((g./ )(g./ 1 ) 2 + (g./ 1 )(g./ ) 2 ) 
-/ (g.T)(g./ 1 ) 2 -/ 1 (9-T)(g-/ ) 2 . 

Substituting T = oq! + ail 1 , equation (l22l becomes 

= (a (g • I ) 2 - ai (q • I 1 ) 2 ) ((g • I 1 )/ - (g • I )/ 1 ) 
Since 1°, and I 1 are linearly independent, we have 

d 



(22) 



dq 



(9 U 







1\2 



a (g ■ I ) 2 - ai(q ■ I 1 ) 



0. 



Hence, g° + g 1 has a critical point on the simplex A„_i if 
and only if 

a {q-I°) 2 ^a^q-I 1 ) 2 . (23) 

Notice that, if ao, and ai have opposite signs, then equation 
( l23l ) can not be satisfied for any q G A„_i, and hence, .g° + ,g 1 
has no critical point on the simplex A„_i. 
If ao, ai > 0, then equation (l23l leads to 



g-(Va^/°-VaT/ 1 ) = 0. 

Therefore, g° + g 1 has a critical point on the simplex A„_i if 
and only if there exists v G A„_i perpendicular to the vector 
v /a^/° - y/a^I 1 . M 
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Lemma 10 (Optimal number of sensors): For n > M, if 
each (M + 1) x (M + 1) submatrix of the matrix 

tM-1 1 r- ™rix(M+l) 



[ T -7° 



is full rank, then the following statements hold: 

i) every solution of the optimization problem (l2TT i lies on 
the probability simplex Aa/-i C A n _i; and 

ii) every time-optimal policy requires at most M sensors 
to be observed. 

Proof: From Lemma [8] we know that if T, 7°, . . . , 
are linearly independent, then the Jacobian of the objective 
function in equation ( f2TT > does not vanish anywhere on the 
simplex A n _i. Hence, a minimum lies at some simplex A„_2, 
which is the boundary of the simplex A„_i. Notice that, if n > 
M and the condition in the lemma holds, then the projections 
of T,I°,...,I 1 on the simplex A„_2 are also linearly 
independent, and the argument repeats. Hence, a minimum 
lies at some simplex Ajf-i, which implies that optimal policy 
requires at most M sensors to be observed. ■ 
Lemma 11 (Optimization on an edge): Given two vertices 
e s and e r , s ^ r, of the probability simplex A n _i, then for 
the objective function in the problem (fJU with AI = 2, the 
following statements hold: 

i) if g°(e s ) < g°(e r ), and g 1 (e s ) < g 1 {e r ), then the 
minima, along the edge joining e s and e r , lies at e s , 
and optimal value is given by ^(g°(e s ) + g 1 (e s )); and 

ii) if g°(e s ) > g°(e r ), and g 1 {e s ) < g 1 (e r ), or vice versa, 
then the minima, along the edge joining e s and e r , lies 
at the point q* = (1 — t*)e s + t*e r , where 

t* = J—G]0,l[, 



I + p 

I?y/T a I} - T r Il - lly/T r P a - TJQ 

I I y/T r I° - Tjo - 1° y/TJ} - T r Il 
and the optimal value is given by 

ksV)+sV)) 



>o, 




Proof: We observe from Lemma [5] that both g°, and g 1 
are monotonically non-increasing or non-decreasing along any 
line. Hence, if g°{e s ) < g°(e r ), and g 1 (e s ) < g 1 (e r ), then the 
minima should lie at e s . This concludes the proof of the first 
statement. We now establish the second statement. We note 
that any point on the line segment connecting e s and e r can 
be written as q(t) = (1 — t)e a + te r . The value of g° + g 1 at 
q is 

nw \s (l-t)T s + tT r (l-t)T s + tT r 

Differentiating with respect to t, we get 



[Or —TT° 

1 s ± r s 1 r 



± s X T ■* 8 X T 



(7° + iW-4 )) 2 {Il+m-Il)f 



(24) 



Notice that the two terms in equation ( l24b have opposite 
sign. Setting the derivative to zero, and choosing the value of 
t in [0, 1], we get t* = j^, where p is as defined in the 
statement of the theorem. The optimal value of the function 
can be obtained, by substituting t = t* in the expression for 

i(5°(^))+.9 1 (9(i)))- ■ 

Theorem 3 (Optimization of average decision time): For 
the optimization problem in equation d2Tb with M = 2, the 
following statements hold: 

i) If 7°, I 1 are linearly dependent, then the solution lies at 
some vertex of the simplex A„_i. 

ii) If 7° and 7 1 are linearly independent, and T = aol + 
ail 1 , accti G M, then the following statements hold: 

a) If ao and oi\ have opposite signs, then the optimal 
solution lies at some edge of the simplex A„_i. 

b) If ao, a.\ > 0, then the optimal solution may lie in 
the interior of the simplex A„_i. 

iii) If every 3x3 sub-matrix of the matrix \T 7° 7 1 ] G 
K" x3 is full rank, then a minimum lies at an edge of 
the simplex A„_x. 

Proof: We start by establishing the first statement. Since, 
7° and 7 1 are linearly dependent, there exists a 7 > such 
that 7° = 77 1 . For 7° = 77 1 , we have g° + g l = (1 + j)g°. 
Hence, the minima of g° + g 1 lies at the same point where 
g° achieves the minima. From Theorem Q] it follows that g° 
achieves the minima at some vertex of the simplex A„_x. 

To prove the second statement, we note that from Lemma [9] 
it follows that if ao, and a\ have opposite signs, then the 
Jacobian of g° + g 1 does not vanish anywhere on the simplex 
A„_i. Hence, the minima lies at the boundary of the simplex. 
Notice that the boundary, of the simplex A„_x, are n simplices 
A„ 2- Notice that the argument repeats till n > 2. Hence, the 
optima lie on one of the (™) simplices Ai, which are the edges 
of the original simplex. Moreover, we note that from Lemma|9] 
it follows that if ao,a\ > 0, then we can not guarantee the 
number of optimal set of sensors. This concludes the proof of 
the second statement. 

To prove the last statement, we note that it follows im- 
mediately from Lemma \TU\ that a solution of the optimization 
problem in equation (l2T~b would lie at some simplex Ai, which 
is an edge of the original simplex. ■ 

Note that, we have shown that, for M = 2 and a generic 
set of sensors, the solution of the optimization problem in 
equation (|2TT i lies at an edge of the simplex A„_i. The optimal 
value of the objective function on a given edge was determined 
in Lemma [TT] Hence, an optimal solution of this problem can 
be determined by a comparison of the optimal values at each 
edge. 

For the multiple hypothesis case, we have determined 
the time-optimal number of the sensors to be observed in 
Lemma [TO] In order to identify these sensors, one needs to 
solve the optimization problem in equation d2Tl i. We notice 
that the objective function in this optimization problem is non- 
convex, and is hard to tackle analytically for M > 2. Interested 
reader may refer to some efficient iterative algorithms in 
linear-fractional programming literature (e.g., J6j) to solve 
these problems. 
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VI. NUMERICAL EXAMPLES 

We consider four sensors connected to a fusion center. 
We assume that the sensors take binary measurements. The 
probabilities of their measurement being zero, under two 
hypotheses, and their processing times are given in the Table U 



TABLE I 

Conditional probabilities of measurement being zero 



Sensor 


Probability(O) 


Processing Time 




Hypothesis 


Hypothesis 1 




1 


0.4076 


0.5313 


0.6881 


2 


0.8200 


0.3251 


3.1960 


3 


0.7184 


0.1056 


5.3086 


4 


0.9686 


0.6110 


6.5445 



We performed Monte-Carlo simulations with the mis- 
detection and false-alarm probabilities fixed at 10~ 3 , and 
computed expected decision times. In Table [II] the numerical 
expected decision times are compared with the decision times 
obtained analytically in equation ([7]). The difference in the 
numerical and the analytical decision times is explained by 
the Wald's asymptotic approximations. 

TABLE II 

Decision times for various sensor selection probabilities 



Sensor selection 




Expected Decision Time 




probability 












Hypothesis 


Hypothesis 1 




Analytical 


Numerical 


Analytical 


Numerical 


[1.0,0,0] 


154.39 


156.96 


152.82 


157.26 


[0.3768,0,0,0.6232] 


124.35 


129.36 


66.99 


76.24 


[0.25,0.25,0.25,0.25] 


55.04 


59.62 


50.43 


55.73 



In the Table [TTT1 we compare optimal policies in each 
Scenario I, II, and III with the policy when each sensor 
is chosen uniformly. It is observed that the optimal policy 
improves the expected decision time significantly over the 
uniform policy. 



TABLE III 

Comparison of the uniform and optimal policies 



Scenario 


Uniform policy 


Optimal policy 




Objective 
function 


Optimal probability 
vector 


Optimal objective 
function 


I 


59.62 


[0,0,1,0] 


41.48 


II 


55.73 


[0,0,1,0] 


45.24 


III 


57.68 


[0,0.4788.0.5212,0] 


43.22 



We performed another set of simulations for the multi- 
hypothesis case. We considered a ternary detection problem, 
where the underlying signal x = 0, 1,2 needs to be detected 
from the available noisy data. We considered a set of four 
sensors and their conditional probability distribution is given 
in Tables [IV] and [V] The processing time of the sensors were 
chosen to be the same as in Table [I] 

The set of optimal sensors were determined for this set 
of data. Monte-Carlo simulations were performed with the 
thresholds rjk, k 6 {0, . . . , M — 1} set at 10~ 6 . A comparison 
of the uniform sensor selection policy and an optimal sensor 



TABLE IV 

Conditional probabilities of measurement being zero 



Sensor 


Probability(O) 




Hypothesis 


Hypothesis 1 


Hypothesis 2 


1 


0.4218 


0.2106 


0.2769 


2 


0.9157 


0.0415 


0.3025 


3 


0.7922 


0.1814 


0.0971 


4 


0.9595 


0.0193 


0.0061 



TABLE V 

Conditional probabilities of measurement being one 



Sensor 


Probability(l) 




Hypothesis 


Hypothesis 1 


Hypothesis 2 


1 


0.1991 


0.6787 


0.2207 


2 


0.0813 


0.7577 


0.0462 


3 


0.0313 


0.7431 


0.0449 


4 


0.0027 


0.5884 


0.1705 



selection policy is presented in Table IVII Again, the significant 
difference between the average decision time in the uniform 
and the optimal policy is evident. 



TABLE VI 

Comparisons of the uniform and an optimal policy 



Policy 


Selection Probability 


Average Decision Time 




q* 


Analytical 


Numerical 


Optimal 


[0, 0.9876, 0, 0.0124] 


38.57 


42.76 


Uniform 


[0.25, 0.25, 0.25, 0.25] 


54.72 


54.14 



We note that the optimal results, we obtained, may only 
be sub-optimal because of the asymptotic approximations 
in equations (0 and (0. We further note that, for small 
error probabilities and large sample sizes, these asymptotic 
approximations yield fairly accurate results 13, and in fact, 
this is the regime in which it is of interest to minimize the 
expected decision time. Therefore, for all practical purposes 
the obtained optimal scheme is very close to the actual optimal 
scheme. 

VII. CONCLUSIONS 

In this paper, we considered the problem of sequential 
decision making. We developed versions SPRT and MSPRT 
where the sensor switches at each observation. We used these 
sequential procedures to decide reliably. We found out the set 
of optimal sensors to decide in minimum time. For the binary 
hypothesis case, three performance metrics were considered 
and it was found that for a generic set of sensors at most 
two sensors are optimal. Further, it was shown that for M 
underlying hypotheses, and a generic set of sensors, an optimal 
policy requires at most M sensors to be observed. A procedure 
for identification of the optimal sensor was developed. In the 
binary hypothesis case, the computational complexity of the 
procedure for the three scenarios, namely, the conditioned 
decision time, the worst case decision time, and the average 
decision time, was 0(n), 0(n 2 ), and 0(n 2 ), respectively. 

Many further extensions to the results presented here are 
possible. First, the time-optimal scheme may not be energy 
optimal. In particular, the time optimal set of sensors may be 



10 



the most distant sensors from the fusion center. Given that the 
power to transmit the signal to the fusion center is proportional 
to the distance from the fusion center, the time-optimal scheme 
is no where close to the energy optimal scheme. This trade off 
can be taken care of by adding a term proportional to distance 
in the objective function. 

When we choose only one or two sensors every time, issues 
of robustness do arise. In case of sensor failure, we need to 
determine the next best sensor to switch. A list of sensors, 
with increasing decision time, could be prepared beforehand 
and in case of sensor failure, the fusion center should switch 
to the next best set of sensors. 
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